Reinforcement theory

Results: 290



#Item
191Dynamic programming / Markov processes / Stochastic control / Network theory / Markov decision process / Reinforcement learning / Symbol / Algorithm / Shortest path problem / Statistics / Mathematics / Applied mathematics

Hierarchical Solution of Large Markov Decision Processes Jennifer Barry and Leslie Pack Kaelbling and Tom´as Lozano-P´erez MIT Computer Science and Artificial Intelligence Laboratory Cambridge, MA 02139, USA {jbarry,lp

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2010-05-17 16:00:47
192Dynamic programming / Markov processes / Stochastic control / Network theory / Markov decision process / Reinforcement learning / Symbol / Algorithm / Shortest path problem / Statistics / Mathematics / Applied mathematics

Hierarchical Solution of Large Markov Decision Processes Jennifer Barry and Leslie Pack Kaelbling and Tom´as Lozano-P´erez MIT Computer Science and Artificial Intelligence Laboratory Cambridge, MA 02139, USA {jbarry,lp

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2012-06-11 20:17:02
193Mathematics / Nash equilibrium / Strategy / Solution concept / Minimax / Best response / Matching pennies / Q-learning / Reinforcement learning / Game theory / Problem solving / Decision theory

Playing is believing: The role of beliefs in multi-agent learning Yu-Han Chang Artificial Intelligence Laboratory Massachusetts Institute of Technology

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2004-07-01 07:47:51
194Computing / Markov processes / Markov models / Equations / Mathematical optimization / Markov decision process / Reinforcement learning / Automated planning and scheduling / Bellman equation / Statistics / Dynamic programming / Control theory

Toward Hierachical Decomposition for Planning in Uncertain Environments Terran Lane and Leslie Pack Kaelbling MIT Artificial Intelligence Laboratory Cambridge, MA, 02139 USA terran,lpk @ai.mit.edu

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2004-07-01 07:47:51
195Game theory / Cybernetics / Machine learning / Search algorithms / Learning / Reinforcement learning / Markov decision process / Multi-armed bandit / Algorithm / Statistics / Mathematics / Applied mathematics

Hedged learning: Regret-minimization with learning experts Yu-Han Chang [removed] CSAIL, Massachusetts Institute of Technology, 32 Vassar Street, Cambridge, MA[removed]USA Leslie Pack Kaelbling

Add to Reading List

Source URL: people.csail.mit.edu

Language: English - Date: 2005-11-02 21:38:45
196Systems science / Youla–Kucera parametrization / Adaptive control / Optimal control / Nonlinear control / Model predictive control / Kalman filter / Robust control / Automatic control / Control theory / Systems theory / Cybernetics

Feedback Controller Parameterizations for Reinforcement Learning John W. Roberts Ian R. Manchester

Add to Reading List

Source URL: groups.csail.mit.edu

Language: English - Date: 2011-02-22 01:10:59
197Systems theory / Markov processes / Stochastic control / Equations / Mathematical optimization / Markov decision process / Reinforcement learning / Automated planning and scheduling / Bellman equation / Statistics / Dynamic programming / Control theory

Scaling Up Decentralized MDPs Through Heuristic Search Jilles S. Dibangoye Christopher Amato INRIA Computer Science and AI Laboratory

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2013-03-11 16:08:28
198Dynamic programming / Stochastic control / Mathematical sciences / Markov processes / Partially observable Markov decision process / Markov decision process / Reinforcement learning / Algorithm / Mathematical optimization / Statistics / Control theory / Operations research

Producing Efficient Error-bounded Solutions for Transition Independent Decentralized MDPs Jilles S. Dibangoye Christopher Amato

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2013-03-11 16:10:29
199Reinforcement learning / Parametrization / Statistics / Estimation theory / Statistical theory / Coordinate systems / Dimensional analysis / Measurement

Transfer Learning by Discovering Latent Task Parametrizations George Konidaris MIT CSAIL Cambridge, MA[removed]removed]

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-11-30 15:08:20
200Fourier analysis / Numerical analysis / Linear algebra / Joseph Fourier / Spectral theory / Fourier series / Proto-value functions / Gibbs phenomenon / Basis function / Mathematical analysis / Mathematics / Algebra

Value Function Approximation in Reinforcement Learning using the Fourier Basis George Konidaris1,3 1 MIT CSAIL [removed]

Add to Reading List

Source URL: lis.csail.mit.edu

Language: English - Date: 2012-06-08 19:46:17
UPDATE